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We present a search for the standard model Higgs boson in 9.5 fb~^ of pp collisions at ^/s = 
1.96 TeV collected with the DO detector at the Fermilab Tevatron Collider. The final state considered 
contains a pair of b jets and is characterized by an imbalance in transverse energy, as expected from 
pp — >■ ZH — >■ vvbh production. The search is also sensitive to the WH — >■ iubb channel when the 
charged lepton is not identified. The data are found to be in good agreement with the expected 
background. For a Higgs boson mass of 125 GeV, we set a limit at the 95% C.L. on the cross 
section (7{pp — > [Z/W]H), assuming standard model branching fractions, that is a factor of 4.3 
times larger than the theoretical standard model value, while the expected factor is 3.9. The 
search is also used to measure a combined WZ and ZZ production cross section that is a factor 
of 0.94 ± 0.31 (stat) ± 0.34 (syst) times the standard model prediction of 4.4 pb, with an observed 
significance of 2.0 standard deviations. 

PACS numbers: 13.85.Qk, 13.85.Ni, 13.85.Rm, 14.80.Bn 



INTRODUCTION 



In the standard model (SM) electroweak symme- 
try breaking is achieved via the introduction of a dou- 
blet of scalar fields, of which one degree of freedom re- 
mains once the W and Z vector bosons have acquired 
their masses. This degree of freedom manifests itself as 
a new scalar particle [2'|, the Higgs boson (H). Associ- 
ated ZH production in pp collisions at y/s — 1.96 TeV, 
with Z ^ vv and H — > 66, is among the most sensi- 
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tive processes in the search for a Higgs boson with mass 
Mh < 135 GeV at the Fermilab Tevatron Collider Q. 
The DO Collaboration published a search for this process 
based on 5.2 fb~^ of integrated luminosity fi^. In this Let- 
ter, an extension of this search to the full Run II dataset 
is presented. The CDF Collaboration recently reported 
results from a similar search as well as the ATLAS 
and CMS Collaborations using pp collisions at 7 TeV at 
the LHC [i, Q. A lower limit of 114.4 GeV was set on 
Mh by the LEP Collaborations Q , while an upper limit 
at 127 GeV has been estabhshed by the ATLAS and CMS 
Collaborations [l3] ■ These limits and those given be- 
low are all defined at the 95% C.L. The ATLAS and CMS 
Collaborations have also published 0, [l3| excesses above 
background expectations at approximately 125 GeV and 
have recently reported results confirming these excesses 



ll|,ll2| 



at the five standard deviations level 

The final-state topology considered in this search con- 
sists of a pair of 6 jets from H ~¥ bb decay and missing 
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transverse energy {I^t) from Z vv. The search is also 
sensitive to the WH process when the charged lepton 
from W ^ iv decay is not identified. The main back- 
grounds arise from (W^/Z) -I- heavy-flavor jets (jets initi- 
ated by & or c quarks) , top quark production, and multi- 
jet (MJ) events with J/]t arising from mismeasurement of 
jet energies. A boosted-decision-tree discriminant based 
on kinematic properties is first used to reject most of the 
multijet events. Next, jets from candidate Higgs boson 
decays are required to be identified as h jets. Finally, dis- 
crimination between signal and remaining backgrounds is 
achieved by means of additional boosted decision trees. 

To validate the techniques used in the search for the 
Higgs boson, the analysis is also interpreted as a mea- 
surement of W Z and ZZ diboson production. The only 
modification is in the training of the final discriminants, 
for which a diboson signal is used instead of a Higgs bo- 
son signal. 



DATA AND SIMULATED SAMPLES 

The DO detector used for Tevatron Run H (2001 - 
2011) is described in detail in Ref. [l^l- Its main com- 
ponents are: a tracking system surrounding the beam 
pipe, followed by a liquid-argon and uranium sampling 
calorimeter, and then a muon system. The tracking sys- 
tem is immersed in a 2 T magnetic field provided by 
a superconducting solenoid and consists of a silicon mi- 
crostrip tracker followed by a scintillating fiber tracker. 
The calorimeter is composed of a central and two end 
sections housed in separate cryostats. Each section is 
segmented in depth, with four electromagnetic layers fol- 
lowed by up to five hadronic layers. Scintillating tiles 
provide additional sampling between the cryostats. The 
muon system consists of tracking and trigger detectors 
in front of and beyond 1.8 T iron toroids. Online event 
selection is provided by a three-level trigger system. 

The data used in this analysis were recorded using trig- 



gers designed to select events with jets and [IJl- After 



imposing data quality requirements, the total integrated 
luminosity recorded with these triggers is 9.5 fb""*^, cor- 
responding to all available Run II data for this analysis. 

The analysis relies on (i) charged particle tracks, (ii) 
calorimeter jets reconstructed in a cone of radius 0.5 in 
y-c^) space, where y is the rapidity and the azimuthal 
angle, using the iterative midpoint cone algorithm [l5| . 
and (iii) electrons or muons identified through the asso- 
ciation of tracks with electromagnetic calorimeter clus- 
ters or with hits in the muon detector, respectively. The 
Ifir is reconstructed as the negative of the vectorial sum 
of the transverse components of energy deposits in the 
calorimeter and is corrected for identified muons. Jet 
energies are calibrated using primarily transverse energy 
balance in photon-|-jet events 16|, and these corrections 
are propagated to the assessment. 



Those backgrounds arising from MJ processes with 
instrumental effects giving rise to are estimated 
from data. The remainder of the backgrounds and the 
signal processes are simulated by Monte Carlo (MC). 
Events from (VF/Z)-|-jets processes are generated with 
ALPGEN , interfaced with pythia [11] for initial- and 
final-state radiation and for hadronization. The spec- 
trum of the Z boson is reweighted to match the DO 
measurement [3]. The spectrum of the W boson 
is reweighted using the same experimental input, cor- 
rected for the differences between the W and Z pT spec- 
tra predicted in next-to-next-to-leading order (NNLO) 
QCD [23|. To simulate tt and electroweak single top 
quark production, the ALPGEN and SINGLETOP [21| gen- 
erators, respectively, are interfaced with pythia, while 
vector boson pair production is generated with pythia. 
The ZH and WH signal processes are generated with 
PYTHIA for Higgs boson masses from 100 to 150 GeV in 
5 GeV steps. All these simulations use CTEQ6L1 parton 
distribution functions (PDFs) [H] . 

The absolute normalizations for (W/Z) inclusive pro- 
duction are obtained from NNLO calculations of to- 
tal cross sections l23j using the MSTW2008 NNLO 
PDFs "24]. The heavy-flavor fractions in {W/Z)+}ets 
are obtained using MCFM 25| at next-to-leading order 
(NLO). The diboson cross sections are also calculated 
with MCFM [27]. Cross sections for pair and single top 
quark production are taken from Ref. [2^. For signal 
processes, cross sections are taken from Ref. 28 1. 

Signal and background samples are passed through 
a full GEANT3-based simulation [2^ of the detector re- 
sponse and processed with the same reconstruction pro- 
gram as used for data. Events from randomly selected 
beam crossings with the same instantaneous luminosity 
distribution as data are overlaid on simulated events to 
account for detector noise and contributions from addi- 
tional pp interactions. Parameterizations of the trigger 
efficiencies are determined using events collected with in- 
dependent triggers based on information from the muon 
detectors. Corrections for residual differences between 
data and simulation are applied for electron, muon, and 
jet identification. Jet energy calibration and resolution 
are adjusted in simulated events to match those measured 
in data. 



EVENT SELECTION 

A preselection that greatly reduces the overwhelm- 
ing background from multijet events is performed as 
follows. The interaction vertex must be reconstructed 
within the acceptance of the silicon vertex detector and 
at least three tracks must originate from that vertex. 
Jets with associated tracks that meet criteria ensuring 
that the 6-tagging algorithm operates efficiently are de- 
noted as "taggable" jets, except for those also identified 
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as hadronic decays of r leptons [30| . Exactly two taggable 
jets are required, one of which must be the leading (high- 
est Pt) jet in the event; the Higgs candidate is formed 
from these two jets, denoted jeti and jet2 (ordered in 
decreasing px)- These jets must have transverse momen- 
tum Pt > 20 GeV and pseudorapidity \r]\ < 2.5. The 
two taggable jets must not be back-to-back in the plane 
transverse to the beam direction: A(/)(jet]^, jet2) < 165°. 
Finally, > 40 GeV is required. 

Additional selection criteria define four distinct sam- 
ples: (i) an "analysis" sample used to search for a Higgs 
boson signal; (ii) an "electroweak (EW) control" sam- 
ple used to validate the background MC simulation, en- 
riched in W{-^ /ii/)-|-jets events where the jet system has 
a topology similar to that of the analysis sample; (iii) an 
"MJ-model" sample, dominated by multijet events, used 
to model the MJ background in the analysis sample; and 
(iv) a large "MJ-enriched" sample, used to validate this 
MJ-modeling procedure. 

The analysis sample is selected by requiring the scalar 
sum of the transverse momenta of the two leading tag- 
gable jets to be greater than 80 GeV and a measure of 
the significance 5 > 5 [sij . Larger values of S corre- 
spond to values that are less likely to be caused by 
fluctuations in jet energies. The S distribution is shown 
for the analysis sample in Fig. |9l 
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FIG. 1: (Color online.) The measure 5 of significance 
in the analysis sample without the requirement that <S be 
larger than 5. The data are shown as points with error bars 
and the background contributions as histograms: dibosons 
are labeled as "VV," "V-l-l.f." includes {W/Z) + {u, d, s, g) jets, 
"V-l-h.f." includes {W/Z) + {b,c) jets, and "Top" includes pair 
and single top quark production. The distribution for signal 
(VH) is scaled by a factor of 500 and includes ZH and WH 
production for Mh = 125 GeV. 

The dominant signal topology is a pair of b jets recoil- 
ing against the due to the neutrinos from Z ^ vv 
decay, leading to the direction of the Ifir being at a large 
angle with respect to the direction of each jet. In con- 
trast, in events from MJ background with fluctuations in 



jet energy measurement, the tends to be aligned with 
a mismeasured jet. A second estimate of the JfiT can be 
obtained from the missing PTii>Ti calculated from the re- 
constructed charged particle tracks originating from the 
interaction vertex. This variable is less sensitive to jet 
energy measurement fluctuations. In signal events, ]j)T 
is also expected to point away from both jets, while for 
MJ background, its angular distribution is expected to 
be more isotropic. Advantage is taken of these features 
through the variable I? = [A(/)(/^t, jeti)-|-A0(;5T, jet2)]/2. 
For signal events, as well as for the non-MJ backgrounds, 
T) > tt/2 in the vast majority of events, whereas the MJ 
background events tend to be symmetrically distributed 
around 7r/2. In the analysis sample, I? > 7r/2 is therefore 
required. To improve the efficiency of this criterion for 
the (W — > fii>)H signal with non-identified muons, tracks 
satisfying isolation criteria are removed from the com- 
putation. The reverse of the V requirement is also used 
to define the MJ-model sample, as explained below. 

Events containing an isolated electron or muon with 
Pt > 15 GeV are rejected to ensure there is no overlap 
with the DO WH search in the lepton-|-^T topology [32] ■ 

The EW control sample is selected in a similar man- 
ner to the analysis sample, except that an isolated muon 
with Pt > 15 GeV is required. The multijet content of 
this sample is rendered negligible by requiring that the 
transverse mass of the muon and system is larger 
than 30 GeV and that the ^t, calculated taking account 
of the muon from the W boson decay, is greater than 
20 GeV. To ensure similar jet topologies for the analy- 
sis and EW control samples, the ^t, not corrected for 
the selected muon, is required to exceed 40 GeV. The 
number of selected events is in good agreement with 
the SM expectation. All the kinematic distributions 
are also well described once reweightings of the distri- 
butions of A77(jeti, jet2) and ?7(jet2) are performed, as 
suggested by a comparison [SS^ of data with a simulation 
of (W^/Z)-|-jets using the SHERPA generator 34]. The dis- 
tribution of the dijet mass in the EW control sample is 
shown in Fig. [TlT a). 

The MJ-model sample, used to determine the MJ back- 
ground, is selected in the same manner as the analy- 
sis sample, except that the requirement V > 7r/2 is re- 
versed. The small remaining contributions from non-MJ 
SM background processes in the 2? < 7r/2 region are sub- 
tracted, and the resulting sample is used to model the MJ 
background in the analysis sample. The MJ background 
in the region V > 7r/2 is normalized by performing a fit 
of the sum of the MJ and SM backgrounds to the 
distribution of the data in the analysis sample. 

The MJ-enriched sample is used to test the validity of 
this approach and is defined in the same manner as the 
analysis sample, except that S < 4.5 is now required (see 
Fig. [9]). As a result, the MJ background dominates the 
entire range of 2? values, and this sample is used to verify 
that the events with V < 7r/2 correctly model those with 
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FIG. 2: (Color online.) Distributions of the dijet mass before 
6-tagging in the (a) EW control and (b) MJ-enriched samples. 
The data are shown as points with error bars and the back- 
ground contributions as histograms: dibosons are labeled as 
"VV," "V+U." includes {W/Z)+{u,d, s, g) jets, "Y+h.i." in- 
cludes {W/Z)+{b, c) jets, and "Top" includes pair and single 
top quark production. 



T) > 7t/2. The distribution of the dijet mass in the MJ- 
enriched sample is shown in Fig. Illf b). 

A multivariate 6-tagging discriminant, with several 
boosted decision trees as inputs, is used to select events 
with one or more b quark candidates. This algorithm 
is an upgraded version of the neural network 6-tagging 
technique described in Ref. [35| . The new algorithm in- 
cludes more information related to the lifetime of the jet 
and results in a better discrimination between b and light 
(m, d, s, g) jets. It provides an output between and 1 for 
each jet, with a value closer to one indicating a higher 
probability that the jet originated from a b quark. The 
output from the algorithm measured in simulated events 
is adjusted to match the output measured in dedicated 
data samples as described in more detail in Ref. [ssj . 
^From this continuous output, thirteen operating points 
(Lf, — 0, 1, ... , 12) are defined, with b purity increasing 
with Lb. Jets with Lf, = are defined as untagged. The 
typical per-&-jet efficiency and misidentification rate for 



light-flavor jets are about 80% (50%) and 10% (1%) for 
the loosest non-zero (tightest) b-tag operating point, re- 
spectively. 

To improve the sensitivity of the analysis, two high 
signal purity samples are defined from the analysis sam- 
ple using the variable Li,b = Lb(jeti) -I- Lb(jet2). The 
two samples are defined as follows: a tight 6-tag sam- 
ple with Lbb > 18 and a medium 6-tag sample with 
11 < Lbb < 17. The medium 6-tag sample contains events 
with two loosely 6-tagged jets, as well as events with one 
tightly ^-tagged jet and one untagged jet. The signal- 
to-background ratios for a Higgs boson mass of 125 GeV 
in the pre, medium, and tight &-tag samples, after ap- 
plying a multijet veto (defined in the next section), are 
respectively 0.035%, 0.23%, and 1.00%. 



ANALYSIS USING DECISION TREES 

A stochastic gradient boosted decision tree (DT) tech- 
nique is employed, as implemented in the tmva pack- 
age [3^, to improve the discrimination between signal 
and background processes. 

First, an "MJ DT" (multijet-rejection DT) is trained 
to discriminate between signal and MJ-model events be- 
fore 6-tagging is applied. To avoid Higgs boson mass 
dependence at this stage of the analysis, signal events 
are not used, and the MJ DT is trained on a sam- 
ple of (W^/Z)-|- heavy- flavor jets events instead. Vari- 
ables that provide some discrimination have been cho- 
sen for the MJ DT, excluding those strongly correlated 
to the Higgs mass (such as the dijet mass itself or the 

-1 and 



y/ (A77)2 + (A(/))2 between jeti and jet2). 
The MJ DT output, which ranges between 
-1-1, is shown in Fig. [13] for the analysis sample after 
the medium 6-tagging requirement. Good agreement is 
seen between data and the predicted background. A 
value of the multijet discriminant in excess of —0.3 is 
required (multijet veto), which removes 93% of the mul- 
tijet background while retaining 90% of the signal for 
Mh = 125 GeV. The numbers of expected signal and 
background events, as well as the number of observed 
events, are given in Table H] after imposing the multijet 
veto. Dijet mass distributions in the analysis sample af- 
ter the multijet veto are shown in Fig. |3] for 6-tagged 
events. 

Next, to separate signal from the remaining SM back- 
grounds, two "SM DTs" (SM-background-rejection DTs) 
are trained for each Mh, one in the medium 6-tag channel 
and one in the tight 6-tag channel. Some of the MJ DT 
input variables are used again, but most of the discrim- 
ination comes from additional kinematic variables corre- 
lated to the Higgs boson mass, of which, as expected, 
the dijet mass has the strongest discriminating power. 
The SM DT outputs, which range between —1 and -1-1, 
are used as final discriminants. Their distributions are 



7 



TABLE I: The numbers of expected signal, expected background, and observed data events after the multijet veto, for the 
pre, medium, and tight 6-tag samples. The signal corresponds to Mh ~ 125 GeV, "Top" includes pair and single top quark 
production, and "VV" is the sum of all diboson processes. The uncertainties quoted on the signal and total background arise 
from the statistics of the simulation and from the sources of systematic uncertainties mentioned in the text. 



Sample 


ZH 


WH 


W±jets Z+jets Top 


VV 


MJ Total Background Observed 


Pre 6-tag 
Medium 6-tag 
Tight &-tag 


18.3 ±1.8 
6.7±0.7 
6.0 ±0.8 


16.7±1.6 
6.1±0.6 
5.3 ±0.7 


66895 25585 1934 
3112 1074 761 
443 252 377 


3144 
237 
56 


1977 99535 ± 12542 98980 
278 5462 ± 776 5453 
6 1134 ±192 1039 
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FIG. 3: (Color online.) Distribution of the MJ DT output af- 
ter the medium ^-tagging requirement in the analysis sample. 
The distribution for signal (VH), shown for Mh ~ 125 GeV, 
is scaled by a factor of 100 and includes ZH and WH produc- 
tion. The data are shown as points with error bars and the 
background contributions as histograms: dibosons are labeled 
as "VV," "V+l.f." includes {W/Z) + {u, d, s, g) jets, "V+h.f." 
includes {W/Z) + {b, c) jets, and "Top" includes pair and single 
top quark production. 



shown in Fig. [5] for Mh = 125 GeV. 



SYSTEMATIC UNCERTAINTIES 

Experimental uncertainties arise from the integrated 
luminosity (6%) [s^, the trigger simulation (2%), the 
jet energy calibration and resolution [(1-2)%], jet re- 
construction and taggability (3%), the lepton identifica- 
tion (1%), the modeling of the MJ background (25%, 
which translates into a 1% uncertainty on the total back- 
ground), and the 6-tagging (from 4% for background in 
the medium 6-tag sample to 9% for signal in the tight 
6-tag sample). In addition to the impact of these uncer- 
tainties on the integrated signal and background yields 
mentioned above, modifications of the shapes of the final 
discriminants are also considered, when relevant. Corre- 
lations among systematic uncertainties in signal and each 
background are taken into account when extracting the 
final results. 

Theoretical uncertainties on cross sections for SM pro- 
cesses are estimated as follows. For (VF/Z)+jets produc- 
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FIG. 4: (Color online.) Dijet invariant mass in the analysis 
sample after the multijet veto for events with (a) medium 
6-tag and (b) tight 6-tag. The distributions for signal (VH), 
which are scaled by a factor of 100 for medium 6-tag and 10 for 
tight 6-tag respectively, include ZH and WH production for 
Mh ~ 125 GeV. The data are shown as points with error bars 
and the background contributions as histograms: dibosons are 
labeled as "VV," "V±l.f." includes {W/Z)+{u,d,s,g) jets, 
"V+h.f." includes {W/Z) + {b,c) jets, and "Top" includes pair 
and single top quark production. 



tion, an uncertainty of 10% is assigned to the total cross 
sections and an uncertainty of 20% to the heavy-flavor 
fractions (estimated using MCFM at NLO [25*1 ) . For other 
SM backgrounds, uncertainties are taken from Ref. [2^ 
or using MCFM and range from 6% to 10%. The 
uncertainties on cross sections for signal (7%) are taken 
from Ref. ^2^. Uncertainties on the shapes of the final 
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discriminants arise from (i) the modeling of {W/Z)+jets, 
assessed by varying the renormahzation and factorization 
scales and by comparing results from ALPGEN interfaced 
with HERWIG [sll to ALPGEN interfaced with pythia, and 
(ii) the choice of PDFs, estimated using the prescription 



of Ref. [22 1 . 



LIMIT SETTING PROCEDURE 

Agreement is found between data and the predicted 
background, both in the numbers of selected events (Ta- 
ble |l| and in the distributions of final discriminants (Fig. 
[5]), once systematic uncertainties are taken into account. 
The modified frequentist CLg approach [s^ is used to 
set limits on the cross section for SM Higgs boson pro- 
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duction, where the test statistic is a log-likelihood ratio 
(LLR) for the background-only and signal-|-background 
hypotheses. The result is obtained by summing LLR 
values over the bins in the final discriminants shown in 
Fig. [S] The impact of systematic uncertainties on the 
sensitivity of the analysis is reduced by maximizing a 
"profile" likelihood function [4^ in which these uncer- 
tainties are given Gaussian constraints associated with 
their priors. Figure [TS] shows a comparison of the SM 
DT distributions expected for a signal with Mh = 125 
GeV and observed for the background-subtracted data. 
The subtracted background and its uncertainties are the 
result of the profile likelihood fit to the data under the 
background-only hypothesis. 
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FIG. 5: (Color online.) The SM DT output for the (W/Z)H 
search with Mr ~ 125 GeV following the multijet veto for 
events with (a) medium 6-tag and (b) tight 6-tag prior to the 
fit to data. The distributions for signal (VH) are scaled by a 
factor of 100 for medium 6-tag events and 10 for tight 6-tag 
events, respectively, and include ZH and WH production for 
Mh = 125 GeV. The data are shown as points with error bars 
and the background contributions as histograms: dibosons are 
labeled as "VV," "V+U." includes {W/Z)+{u,d, s,g) jets, 
"V-|-h.f." includes {W/Z) + {b,c) jets, and "Top" includes pair 
and single top quark production. 



FIG. 6: (Color online.) Final discriminant distributions ex- 
pected for a SM VH signal with Mh = 125 GeV (filled his- 
togram) and observed for background-subtracted data (points 
with statistical error bars) for the (a) medium and (b) tight b- 
tag channels. The subtracted background is the result of the 
profile likelihood fit to the data under the background-only 
hypothesis. Also shown is the ±1 standard deviation (s.d.) 
band on the fitted background. No scaling factor is applied 
to the signal. 
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HIGGS BOSON SEARCH RESULTS 

The results are given as limits in Table [TT] and Fig.[7l^a) 
and in terms of LLR values in Fig. [7l[b). For Mh = 
125 GeV, the observed and expected limits on the com- 
bined cross section of ZH and WH production are fac- 
tors of 4.3 and 3.9 larger than the SM value, respectively, 
assuming SM branching fractions. In Fig. [Tjb), the me- 
dian expected LLR in the presence of a Higgs boson with 
a mass of 125 GeV is also shown for comparison. 
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FIG. 7: (Color online.) (a) Ratio of the observed (solid black) 
and expected (dotted red) exclusion limits to the SM produc- 
tion cross section, (b) The observed (solid black) and ex- 
pected LLRs for the background-only (black dots) and sig- 
nal -|-background hypotheses (short red dashes), as well as 
the LLR expected in the presence of a Higgs boson with 
Mh = 125 GeV (long blue dashes). All are shown as a func- 
tion of the tested value of Mh with the green and yellow 
shaded areas corresponding to the 1 and 2 standard deviation 
(s.d.) variations around the background-only hypothesis. 



DIBOSON SEARCH RESULTS 

The final states arising from the SM production of 
{Z vv){Z bb) and {W £v){Z bb) are the 



same in particle content and topology as those used for 
the Higgs boson search reported above when the lep- 
ton from W is not reconstructed. Evidence for 

ZZ and WZ production can therefore be used to vali- 
date the techniques employed in the Higgs boson search. 
The only modification to the analysis is in the training 
of the final discriminants, where ZZ and W Z are now 
treated as signal with the remaining diboson process, 
WW, kept as background. A cross section scale factor of 
0.94 ± 0.31 (stat) ± 0.34 (syst) is measured with respect 
to the predicted SM value of (4.4 ± 0.3) pb [13, with 
an observed (expected) significance of 2.0 (2.1) standard 
deviations. 

The measurement of the diboson cross section has also 
been carried out using as final discriminants the distribu- 
tions of dijet invariant mass (as opposed to the SM DTs) 
in the medium and tight 6-tag samples. A cross section 
scale factor of 1.08 ± 0.35 (stat) ± 0.39 (syst) is measured 
with respect to the predicted SM value, with an observed 
(expected) significance of 2.0 (1.9) standard deviations. 
The expected significance is slightly lower than the one 
expected with the multivariate analysis, in which addi- 
tional discrimination is provided by variables such as the 
angular separation between jets or the event centrality. 

Figure [T7] shows the final discriminant distributions 
in the medium and tight 6-tag channels, as well as the 
dijet mass distribution summed over the medium and 
tight &-tag channels, for the expected WZ + ZZ sig- 
nal and for the background-subtracted data. The sub- 
tracted backgrounds and their uncertainties are the re- 
sults of the profile likelihood fits to the data under the 
signal+background hypothesis. 



SUMMARY 

We have performed a search for the standard model 
Higgs boson in 9.5 fb~^ oipp collisions at ^/s = 1.96 TeV 
collected with the DO detector at the Fermilab Tevatron 
Collider. The final state considered contains a pair of 
b jets and is characterized by an imbalance in trans- 
verse energy, as expected from pp — ZH — > ui/bb pro- 
duction and decays. The search is also sensitive to the 
WH — 7> liybb channel when the charged lepton is not 
identified. The data are found to be in good agree- 
ment with the expected background. For a Higgs bo- 
son mass of 125 GeV, we set a limit at the 95% C.L. 
on the cross section a{pp -> [Z/W]H), assuming stan- 
dard model branching fractions, that is a factor of 4.3 
larger than the theoretical standard model value, for an 
expected factor of 3.9. 

To validate our analysis techniques, we also performed 
a search for WZ and ZZ production, resulting in a mea- 
surement of the combined cross section that is a factor 
of 0.94 ± 0.31 (stat) ± 0.34 (syst) relative to the standard 
model prediction, with a significance of 2.0 standard de- 
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TABLE II: The expected and observed upper limits measured using 9.5 fb ^ of integrated luminosity on the ZH plus WH 
production cross section relative to the SM expectation, assuming SM branching fractions, as a frinction of Mh- 
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FIG. 8: (Color online.) Final discriminant distributions ex- 
pected for a WZ plus ZZ signal (filled histogram) and ob- 
served for background-subtracted data (points with statistical 
error bars) for the (a) medium and (b) tight b-tag channels, 
(c) Similarly for the dijet mass distribution, summed over 
the medium and tight 6-tag channels. The subtracted back- 
grounds are the results of profile likelihood fits to the data 
under the signal-f background hypothesis. Also shown are the 
±1 standard deviation (s.d.) bands on the fitted backgrounds. 
The signal is scaled to the SM cross section. 
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Supplementary material 
Introduction 

No added material. 

Data and Simulated Samples 

No added material. 

Event selection 

The S distribution is shown for the analysis and EW- 
control samples in Fig. [HI 

The effectiveness of the use of the variable T) — 
(A0(;5t, jeti) + A0(;^T, jet2))/2 can be seen in Fig. [101 
where the distribution of T) is shown for the EW con- 
trol sample, dominated by events with real ^t, and for 
the MJ-enriched sample, dominated by events with 
arising from instrumental effects. For signal events, as 
well as for the non-MJ backgrounds, it is expected that 
2? > 7r/2 in the vast majority of events, whereas the MJ 
background events tend to be symmetrically distributed 
around tt/2. In the analysis sample, 2? > 7r/2 is therefore 
required. 

The distributions for the pre 6-tag sample dijet AR and 
^It/Ht (defined in Table UlI)) . and for the dijet invariant 
mass for medium 6-tag and tight 6-tag samples are shown 
in Fig. [TT] for the EW-control sample and in Fig. [1^ for 
the MJ-enriched sample. 

Analysis using decision trees 

The full list of the seventeen input variables to the MJ 
DT is given in Table Iml 

The MJ DT output is shown for the analysis and EW 
control samples after the medium b tagging requirement 
in Fig. [H 

The distributions for the dijet invariant mass, missing 
Er, dijet AR and the b-tagging discriminating variable 
(ifcb) are shown in Fig. [T3] for the analysis sample after 
the multijet veto and before any ^-tagging requirement. 

The full list of variables used in the SM DT is shown 
in Table Uni 



Systematic uncertainties 

Systematic uncertainties are summarized in Table HVl 
The numbers quoted are uncertainties on total yields. 
The background cross sections entry represents the global 
effect of cross section uncertainties on the sum of back- 
grounds. There is no luminosity uncertainty associated 
with the multijet normalization since it comes from real 
data. The multijet is a non-negligible background com- 
ponent in medium 6-tag sample. 

In addition to the impact of these uncertainties on the 
integrated signal and background yields, modifications of 
the shapes of the final discriminants are also considered, 
when relevant. These originate mainly from jet correc- 
tions (energy scale, resolution and 6-tagging) and also 
have small contributions from Monte Carlo reweightings 
and from parton distribution function variations. 

Limit setting procedure 

Figure [m shows for = 125 GeV the SM DT distri- 
butions after profiling. In this case, the background pre- 
diction and its uncertainties have been determined from 
the fit to data under the background-only hypothesis. 

Higgs Search Results 

No added material. 

Diboson Search Results 

The medium and tight 6-tag SM DTs are shown in 
Fig.Iia 

Figure [17] shows the SM DT distributions. The back- 
ground prediction and its uncertainties have been deter- 
mined from a fit to the data under the signal -|-background 
hypothesis. 

Figure [THj shows the dijet invariant mass distributions, 
along with the background-subtracted data. The back- 
ground prediction and its uncertainties have been deter- 
mined from a fit to the data under the signal+background 
hypothesis. 

Summary 

No added material. 
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TABLE III: Variables used as input to the decision trees, where the angles 6 and <f> are the polar and azimuthal angles defined 
with respect to the proton beam direction, jeti refers to the leading taggable jet, jet2 refers to the next-to-leading taggable 
jet, jail refers to any jet in the event with px > 15 GeV, pseudorapidity \ri\ < 3.2 and without the taggability requirement. 
The thrust axis is the direction obtained from the difference of the transverse momenta of the leading and next-to-leading jots. 
The recoil is defined in the plane transverse to the beam using i) either the amount of missing transverse energy that remains 
after removal of the two leading jets, ii) or the sum of all good jet transerse momenta in the half plane opposite to the one 
containing the dijet system (with respect to the thrust axis). Among these two possible recoil definitions, the one that has the 
larger component along the direction orthogonal to the thrust is chosen. 



Variables used in the MJ DT 

A0(jeti,jet2) 
r] of jeti 

significance 
min A0(^T, jail) 

max A</)(^T, jail) + min A0(^t, jaii) 

max A</>(_^T, jail) - min A</)(_^T, jail) 

Ifr (vectorial sum of jaii Pt) 

Ifr/ Ht (with Ht the scalar sum of jaii pr) 

Asymmetry between and fir'- (^t — ^t)/{Pt +^t) 

component along the thrust axis 

component perpendicular to the thrust axis 
Sum of the signed components of the dijet and recoil momenta along the thrust Eixis 
Sum of the signed components of the dijet and recoil momenta perpendicular to the thrust axis 
Centrality (ratio of the scalar sum of jeti and jet2 pr to the sum of their energies) 
9 angle of the dijet system 

Polar angle of jeti boosted to the dijet rest frame with respect to the dijet direction in the laboratory 

Variables used in the SM DT 

Dijet mass 

Dijet transverse mass 

jeti Pt 
jet2 PT 

Scalar sum of jeti and jet2 Pt 

7? of jeti 

T] of jet2 

A?7(jeti,jet2) 

A0(jetj,jet2) 

AJ?(jeti,jet2) 

Pt weighted A_R(jeti, jaii) 

Pt weighted Aii(jet2, jaii) 

i/r (scalar sum of jaii pt) 

Ifr (vectorial sum of jaii Pt) 

fir/ Ht 

A(^(^T, dijet) 

6 angle of jeti boosted to the dijet rest frame 

Polar angle of jeti boosted to the dijet rest frame with respect to the dijet direction in the laboratory 

min A<?!)(.^T, jail) 

max A(/>(.^T, jail) + min A(/>(.^t, jaii) 
Dijet Pt 

A0(^T,jeti) 
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TABLE IV: Systematic uncertainties, in percent, of the overall signal and background yields. "Jet EC" and "Jet ER" stand for 
jet energy calibration and resolution respectively. "Jet R&T" stands for jet reconstruction and taggability. "Signal" includes 
ZH and WH production and is shown for mn = 125 GeV. 



Systematic Uncertainty | Signal (%) | Background (%) 



Medium 6-tag 


Jet EC - Jet ER 


± 0.9 


± 1.9 


Jet R&T 


± 2.9 


± 2.9 


b Tagging 


± 0.6 


± 3.7 


Trigger 


± 2.0 


± 1.9 


Lepton Identification 


± 0.8 


± 0.9 


Heavy Flavor Fractions 


- 


± 8.5 


Cross Sections 


± 7.0 


± 9.8 


Luminosity 


± 6.1 


± 5.8 


Multijet Normalilzation 




± 1.2 


Total 


± 10.0 


± 14.2 


Tig 


tit 6-tag 




Jet EC - Jet ER 


± 1.0 


± 1.8 


Jet R&T 


± 2.7 


± 3.1 


b Tagging 


± 8.6 


± 7.4 


Trigger 


± 2.0 


± 2.0 


Lepton Identification 


± 0.9 


± 1.1 


Heavy Flavor Fractions 




± 11.1 


Cross Sections 


± 7.0 


± 10.0 


Luminosity 


± 6.1 


± 6.1 


Multijet Normalilzation 




± 0.1 


Total 


± 13.2 


± 16.9 




Significance Significance 

(a) (b) 

FIG. 9: Missing Et significance in (a) the analysis and (b) the EW-control samples without the requirement that the significance 
be larger than 5. The data are shown as points and the background contributions as histograms; dibosons are labeled as "VV," 
"V+l.f." includes {W/Z)-\-(u,d,s,g) jets, "V+h.f." includes {W/Z) + {b, c) jets and "Top" includes pair and single top quark 
production. In (a), the distribution for signal (VH) is multiplied by a factor of 500 and includes ZH and WH production for 
niH = 125 GeV. 
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ZH->vybb EW Control sample (Pre b-taq) 



x1 ZH->yybb Mulli jet Enriched sample (Pre b-taq) 




0.5 1 1.5 2 2.5 3 
(max A(|)(Pt Trkjets) + min A(|)(Pt Trk,jets))/2 



1 1.5 2 2.5 
(max A(|)(Pt Trkjets) + min A(|)(Pt Trk,jets))/2 



(a) 



(b) 



FIG. 10: Distribution of T> in (a) the EW-control sample and (b) the MJ-enriched sample, without the requirement that it 
be larger than 7r/2. The data are shown as points and the background contributions as histograms: dibosons are labeled as 
"VV," "V+l.f." includes {W / Z) + {u,d, s, g) jets, "V+h.f." includes {W/Z) + (b,c) jets and "Top" includes pair and single top 
quark production. In (b), the shaded region (2? < 7r/2) is used to model the events in the unshaded region {T> > 7r/2); the 
dip observed in the region around 7r/2 is due to the acoplanarity cut between the Higgs candidate jets. These distributions are 
shown before b tagging. 




DiJet Invariant lUass (GeV) DiJet Invariant Mass (GeV) 

(c) (d) 

FIG. 11: Representative variable distributions in the EW-control sample: (a) dijet Af? in the pre 6-tag sample, (b) I/It/Ht 
(defined in Table I) in the pre fe-tag sample, (c) dijet invariant mass in the medium 6-tag sample, (d) dijet invariant mass in 
the tight fo-tag sample. The data are shown as points and the background contributions as histograms: dibosons are labeled as 
"VV," "V-l-l.f." includes {W / Z) + {u,d, s, g) jets, "V-|-h.f." includes {W/Z) + {b,c) jets and "Top" includes pair and single top 
quark production. 
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DiJet A R / 



(a) (b) 




DiJet Invariant Mass (GeV) DiJet Invariant Mass (GeV) 



(c) (d) 



FIG. 12: Representative variable distributions in the MJ-enriched sample: (a) dijet A_R in the pre 6-tag sample, (b) ^t/Ht 
(defined in Table I) in the pre 6-tag sample, (c) dijet invariant mass in the medium 6-tag sample, (d) dijet invariant mass in 
the tight 6-tag sample. The data with I? > 7r/2 are shown as points and the background contributions as histograms: dibosons 
are labeled as "VV," "V+l.f." includes {W / Z) + (u,d, s, g) jets, "V+h.f." includes {W/Z) + (b,c) jets and "Top" includes pair 
and single top quark production. The "multijet" histogram is obtained from the data with O < 7r/2 




FIG. 13: MJ DT output after the medium &-tagging requirement in the (a) analysis sample and (b) EW-control sample. The 
distribution for signal (VH), shown for mn = 125 GeV, is multiplied by a factor of 100 and includes ZH and WH production. 
The data are shown as points and the background contributions as histograms: dibosons are labeled as "VV," "V+l.f." includes 
{W / Z)-\-{u,d, s, g) jets, "V+h.f." includes {W/Z) + {b,c) jets and "Top" includes pair and single top quark production. 
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(c) 




FIG. 14: Representative variable distributions in the analysis sample after the multijet veto and before any 6 tagging require- 
ment: (a) dijet invariant mass, (b) missing Et, (c) dijet AT?, (d) b-tagging discriminating variable (Ltb)- The bin at zero is 
surpressed in this plot due to the large number of entries, mostly from pairs of light jets. The relatively high number of events 
observed at Li,b = 12 comes mainly from events with one untagged jet and one very tightly &-tagged jet; the bin at Li,t ~ 24 
comes from events with two very tightly fe-tagged jets. The distributions for signal (VH), which are multiplied by a factor of 
500 for (a)-(c) and 50 for (d), include ZH and WH production for niH = 125 GeV. The data are shown as points and the 
background contributions as histograms: dibosons are labeled as "VV," "V+l.f." includes {W/Z) + {u, d, s, g) jets, "V+h.f." 
includes {W/Z) + {b,c) jets and "Top" includes pair and single top quark production. 
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Final Disciminant Final Disclminant 



(a) (b) 

FIG. 15: The SM DT output, for mn ~ 125 GeV, following the multijet veto and after the fit to the data under the background- 
only hypothesis in the (a) medium and (b) tight fe-tag channels. The data are shown as points and the background contributions 
as histograms: dibosons are labeled as "VV", "V+l.f." includes {W/Z) + {u, d, s, g) jets, "V+h.f." includes {W/Z) + {b, c) jets 
and "Top" includes pair and single top quark production. 



VZ Analysis sample (Medium b-tag) VZ Analysis sample (Tight b-tag) 




FIG. 16: The SM DT output for the WZ and ZZ diboson search following the multijet veto for (a) medium and (b) tight 
tag prior to the fit to data. The data are shown as points and the background contributions as histograms; "V+l.f." includes 
{W/Z) + {u, d, s, g) jets, "V+h.f." includes {W/Z) + {b,c) jets and "Top" includes pair and single top quark production. The 
WZ and ZZ signal is denoted as VZ. The distributions for signal are scaled to the SM cross section (filled red histogram) and 
shown separately multiplied by a factor of 10 for medium 6-tag and 5 for tight 6-tag (solid red line) respectively. 
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FIG. 17: The SM DT output for the WZ and ZZ diboson search, following the multijet veto, and after the fit to the data 
under the signal+background hypothesis in the (a) medium and (b) tight tag channels. The data are shown as points and the 
background contributions as histograms; "V+l.f." includes {W/Z) + {u,d,s,g) jets, "V+h.f." includes {W/Z) + {b,c) jets and 
"Top" includes pair and single top quark production. The WZ and ZZ signal expectation (red histogram, and denoted VZ) is 
scaled to the SM cross section. 
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FIG. 18: The dijet invariant mass for the WZ and ZZ diboson search, following the multijet veto, and after the fit to the data 
under the signal+background hypothesis in the (a) medium and (b) tight tag channels. The data are shown as points and the 
background contributions as histograms; "V+I.f." includes {W/Z) + {u,d,s,g) jets, "V+h.f." includes {W/Z) + {b,c) jets and 
"Top" includes pair and single top quark production. The WZ and ZZ signal expectation (red histogram, and denoted VZ) 
and the data after subtracting the fitted background (points) are shown in the (c) medium and (d) tight tag channels. Also 
shown is the ±1 standard deviation band on the total background after fitting. The signal is scaled to the SM cross section. 



